AITopics

2405.00026

Country:

North America > United States > Arizona > Coconino County > Flagstaff (0.05)
North America > United States > Ohio > Butler County > Oxford (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(3 more...)

Genre: Research Report > New Finding (0.69)

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Banking & Finance (1.00)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

arXiv.org Artificial IntelligenceJan-1-2024

Downstream Task-Oriented Generative Model Selections on Synthetic Data Training for Fraud Detection Models

Cheng, Yinan, Wang, Chi-Hua, Potluru, Vamsi K., Balch, Tucker, Cheng, Guang

Devising procedures for downstream task-oriented generative model selections is an unresolved problem of practical importance. Existing studies focused on the utility of a single family of generative models. They provided limited insights on how synthetic data practitioners select the best family generative models for synthetic training tasks given a specific combination of machine learning model class and performance metric. In this paper, we approach the downstream task-oriented generative model selections problem in the case of training fraud detection models and investigate the best practice given different combinations of model interpretability and model performance constraints. Our investigation supports that, while both Neural Network(NN)-based and Bayesian Network(BN)-based generative models are both good to complete synthetic training task under loose model interpretability constrain, the BN-based generative models is better than NN-based when synthetic training fraud detection model under strict model interpretability constrain. Our results provides practical guidance for machine learning practitioner who is interested in replacing their training dataset from real to synthetic, and shed lights on more general downstream task-oriented generative model selection problems.

dataset, fraud detection model, generative model, (11 more...)

2401.00974

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.89)

Industry: Law Enforcement & Public Safety > Fraud (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.54)

Hsieh, Din-Yin, Wang, Chi-Hua, Cheng, Guang

Improve Fidelity and Utility of Synthetic Credit Card Transaction Time Series from Data-centric Perspective

arXiv.org Artificial IntelligenceJan-1-2024

Exploring generative model training for synthetic tabular data, specifically in sequential contexts such as credit card transaction data, presents significant challenges. This paper addresses these challenges, focusing on attaining both high fidelity to actual data and optimal utility for machine learning tasks. We introduce five pre-processing schemas to enhance the training of the Conditional Probabilistic Auto-Regressive Model (CPAR), demonstrating incremental improvements in the synthetic data's fidelity and utility. Upon achieving satisfactory fidelity levels, our attention shifts to training fraud detection models tailored for time-series data, evaluating the utility of the synthetic data. Our findings offer valuable insights and practical guidelines for synthetic data practitioners in the finance sector, transitioning from real to synthetic datasets for training purposes, and illuminating broader methodologies for synthesizing credit card transaction time series.

cpar model, dataset, synthetic data, (11 more...)

2401.00965

Country: North America > United States > California (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Banking & Finance > Credit (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Campo, Bavo D. C., Antonio, Katrien

An engine to simulate insurance fraud network data

arXiv.org Artificial IntelligenceAug-21-2023

Traditionally, the detection of fraudulent insurance claims relies on business rules and expert judgement which makes it a time-consuming and expensive process (\'Oskarsd\'ottir et al., 2022). Consequently, researchers have been examining ways to develop efficient and accurate analytic strategies to flag suspicious claims. Feeding learning methods with features engineered from the social network of parties involved in a claim is a particularly promising strategy (see for example Van Vlasselaer et al. (2016); Tumminello et al. (2023)). When developing a fraud detection model, however, we are confronted with several challenges. The uncommon nature of fraud, for example, creates a high class imbalance which complicates the development of well performing analytic classification models. In addition, only a small number of claims are investigated and get a label, which results in a large corpus of unlabeled data. Yet another challenge is the lack of publicly available data. This hinders not only the development of new methods, but also the validation of existing techniques. We therefore design a simulation machine that is engineered to create synthetic data with a network structure and available covariates similar to the real life insurance fraud data set analyzed in \'Oskarsd\'ottir et al. (2022). Further, the user has control over several data-generating mechanisms. We can specify the total number of policyholders and parties, the desired level of imbalance and the (effect size of the) features in the fraud generating model. As such, the simulation engine enables researchers and practitioners to examine several methodological challenges as well as to test their (development strategy of) insurance fraud detection models in a range of different settings. Moreover, large synthetic data sets can be generated to evaluate the predictive performance of (advanced) machine learning techniques.

artificial intelligence, fraudulent claim, machine learning, (13 more...)

2308.11659

Country:

Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)
North America > United States > New York (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
(8 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.67)

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Information Technology (1.00)
Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.34)

#artificialintelligenceMar-10-2023, 21:25:38 GMT

Data Scientist (Detection) at Ravelin - London, England, United Kingdom - Remote

We're a fraud detection company using advanced machine learning and network analysis technology to solve big problems. Our goal is to make online transactions safer and help our clients feel confident serving their customers. And we have fun in the meantime! We are a friendly bunch and pride ourselves in having a strong culture and adhering to our values of empathy, ambition, unity and integrity. We really value work/life balance and we embrace a flat hierarchy structure company-wide.

data scientist, detection team, ravelin, (6 more...)

Country: Europe > United Kingdom > England > Greater London > London (0.40)

Industry: Law Enforcement & Public Safety > Fraud (0.62)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.56)

#artificialintelligenceSep-20-2022, 07:30:15 GMT

CONFUSION MATRIX

Accuracy: Of all the classes, how many you predicted right. Accuracy is simply the fraction of the total sample that is correctly identified. Precision: Out of all the classes we have predicted as positive, how many are actually positive. Precision is very useful when you have a model that starts some kind of business workflow (e.g. So, you want your model to be as correct as possible when it says 1 and don't care too much when it predicts 0. That's why we see only the second column of the confusion matrix, which is related to a prediction equal to 1. Precision is very used in marketing campaigns, because a marketing automation campaign is supposed to start an activity on a user when it predicts that they will respond successfully.

artificial intelligence, machine learning, threshold value, (12 more...)

Industry: Marketing (0.57)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

#artificialintelligenceAug-31-2022, 18:35:18 GMT

4 Reasons Why Companies are Using AutoML

The meager supply and high salaries of data scientists have led to a decision among many companies totally in keeping with artificial intelligence ― to automate whatever is possible. Case in point is machine learning. A Forrester study found that automated machine learning (AutoML) has been adopted by 61% of data and analytics decision makers in companies using AI, with another 25% of companies saying they'll do so in the next year. Automated machine learning (AutoML) automates repetitive and manual machine learning tasks. That's no small thing, especially when data scientists and data analysts now spend a majority of their time cleaning, sourcing, and preparing data.

automl, data scientist, validated design, (14 more...)

Genre: Research Report > New Finding (0.35)

Industry: Information Technology (0.72)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

#artificialintelligenceJun-3-2022, 12:25:43 GMT

How AmEx used its credit fraud AI to start a banking product

When credit card giant American Express began offering bank accounts for the first time last year, it had a foundation of fraud detection to bring to an entirely new product arena. That meant in some cases, the company could port over AI and machine-learning models used to spot phony identities or dodgy transactions for its credit card products to its consumer and business checking accounts. But it's been a process, and now, AmEx plans to invest in bringing additional AI techniques used to protect against credit card fraud to its banking products. "We have models which run to detect whether it's you or whether somebody else is logging into your account. Very straightforwardly, we transferred it to the banking product," said Abhinav Jain, vice president for Global Fraud Decision Science at AmEx, who is responsible for the company's fraud detection models.

banking product, customer, transaction, (13 more...)

Country:

North America > United States (0.31)
Europe > Ukraine (0.06)

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Banking & Finance (1.00)
Government > Regional Government > North America Government > United States Government (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)

#artificialintelligenceAug-12-2020, 16:20:12 GMT

Fraud Detection with Machine Learning

Fraud is one of the major issues we come up majorly in banks, life insurance, health insurance, and many others. These major frauds are dependent on the person who is trying to sell you the fake product or service, if you are matured enough to decide what is wrong then you will never get into any fraud transactions. But one such fraud that has been increasing a lot these days is fraud in making payments. In this article, I will take you through a solution to fraud detection with machine learning. The dataset that I will use for this task can be easily downloaded from here.

artificial intelligence, machine learning, transaction, (10 more...)

Industry:

Law Enforcement & Public Safety > Fraud (0.84)
Banking & Finance > Insurance (0.58)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceMay-25-2020, 04:34:27 GMT

China's State News Agency Introduces New Artificial Intelligence Anchor

The traditional method of training AI models involves setting up servers where models are trained on data, often through the use of a cloud-based computing platform. However, over the past few years an alternative form of model creation has arisen, called federated learning. Federated learning brings machine learning models to the data source, rather than bringing the data to the model. Federated learning links together multiple computational devices into a decentralized system that allows the individual devices that collect data to assist in training the model. In a federated learning system, the various devices that are part of the learning network each have a copy of the model on the device.

artificial intelligence, federated, machine learning, (13 more...)

Country: Asia > China (0.40)

Industry:

Media > News (0.40)
Information Technology > Services (0.35)
Education (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)